A Minimal Encoding Approach to Feature Discovery

نویسنده

  • Mark Derthick
چکیده

This paper discusses unsupervised learning of orthogonal concepts on relational data. Relational predicates, while formally equivalent to the features of the concept-learning literature, are not a good basis for de ning concepts. Hence the current task demands a much larger search space than traditional concept learning algorithms, the sort of space explored by connectionist algorithms. However the intended application, using the discovered concepts in the Cyc knowledge base, requires that the concepts be interpretable by a human, an ability not yet realized with connectionist algorithms. Interpretability is aided by including a characterization of simplicity in the evaluation function. For Hinton's Family Relations data, we do nd cleaner, more intuitive features. Yet when the solutions are not known in advance, the di culty of interpreting even features meeting the simplicity criteria calls into question the usefulness of any reformulation algorithm that creates radically new primitives in a knowledge-based setting. At the very least, much more sophisticated explanation tools are needed. This paper discusses conceptual clustering using the Minimum Description Length principle in a domain of family relationships rst solved by Hinton using back-propagation. This problem is unsuitable for algorithms such as ID3 because the features used in the problem description are very far removed from those present in an intuitive theory of family relationships. The algorithm described here, like back-propagation, uses constructive induction to discover a new set of features. The goal is then to use them in rules in the Cyc knowledge base to capture some of the domain regularities. Hence the features must be interpretable by a human. This requires maintaining the power of connectionist systems to discover completely new features while enhancing interpretability, which is accomplished by including a characterization of feature simplicity in the feature evaluation function. Indeed no other algorithm has discovered such clean intuitive features for problems, like Family Relations, in which the original data is not naturally represented as feature-vectors. Yet this success has pointed out the di culty of interpreting even very good features, and calls into question the usefulness of any representation discovery algorithm that creates radically new primitives in a knowledge-based setting. At the very least, much more sophisticated explanation tools are needed. This is an abridged version of ACT-CYC-234-90, \The Minimum Description Length Principle Applied to Feature Learning and Analogical Mapping." MCC Non-Con dential ii

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fuzzy-rough Information Gain Ratio Approach to Filter-wrapper Feature Selection

Feature selection for various applications has been carried out for many years in many different research areas. However, there is a trade-off between finding feature subsets with minimum length and increasing the classification accuracy. In this paper, a filter-wrapper feature selection approach based on fuzzy-rough gain ratio is proposed to tackle this problem. As a search strategy, a modifie...

متن کامل

Receptive Field Encoding Model for Dynamic Natural Vision

Introduction: Encoding models are used to predict human brain activity in response to sensory stimuli. The purpose of these models is to explain how sensory information represent in the brain. Convolutional neural networks trained by images are capable of encoding magnetic resonance imaging data of humans viewing natural images. Considering the hemodynamic response function, these networks are ...

متن کامل

Translation-Based Revision and Merging for Minimal Horn Reasoning

In this paper we introduce a new approach for revising and merging consistent Horn formulae under minimal model semantics. Our approach is translation-based in the following sense: we generate a propositional encoding capturing both the syntax of the original Horn formulae (the clauses which appear or not in them) and their semantics (their minimal models). We can then use any classical revisio...

متن کامل

Semantic Feature Analysis Treatment for Anomia of Two Nonfluent Persian-Speaking Aphasic Patients

Objectives: Semantic Feature Analysis was designed to improve lexical retrieval of aphasic patients via activation of semantic networks of the words. In this approach, the anomic patients are cured with semantic information to assist oral naming. The purpose of this study was to examine the effects of Semantic Feature Analysis treatment on anomia of two nonfluent aphasic patients. Methods: A...

متن کامل

Knowledge discovery using neural approach for SME's credit risk analysis problem in Turkey

This study proposes a knowledge discovery method that uses multilayer perceptron (MLP) based neural rule extraction (NRE) approach for credit risk analysis (CRA) of real-life small and medium enterprises (SMEs) in Turkey. A feature selection and extraction stage is followed by neural classification that produces accurate rule sets. In the first stage, the feature selection is achieved by decisi...

متن کامل

مقایسه تأثیر سه رویکرد یاد‌دهی ـ یادگیری بر عملکرد یادگیری دانش‌آموزان در درس‌زیست‌شناسی

Present study was designed to investigate the effects of three teaching- learning approaches including discovery, interactive and transmission approaches on the students learning performance in biology lesson. In this quasi- experimental research three experimental groups (N1=60, N2=71, N3=63) were used in order to identify any significant difference between the students learning performance wh...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1991